Thesaurus-Based Search in Large Heterogeneous Collections
نویسندگان
چکیده
In cultural heritage, large virtual collections are coming into existence. Such collections contain heterogeneous sets of metadata and vocabulary concepts, originating from multiple sources. In the context of the E-Culture demonstrator we have shown earlier that such virtual collections can be effectively explored with keyword search and semantic clustering. In this paper we describe the design rationale of ClioPatria, an open-source system which provides APIs for scalable semantic graph search. The use of ClioPatria’s search strategies is illustrated with a realistic use case: searching for ”Picasso”. We discuss details of scalable graph search, the required OWL reasoning functionalities and show why SPARQL queries are insufficient for solving the search problem.
منابع مشابه
A faceted search system for facilitating discovery-driven scientific activities: a use case from functional ecology
To address biodiversity issues in ecology and assess the consequences of ecosystem changes, large quantities of long-term observational data from multiple data sets need to be integrated and characterized in a unified way. During these last decades, functional trait-based approaches have shown great potential to facilitate the understanding and the prediction of ecosystem changes. To promote da...
متن کاملSupporting Semantic Image Annotation and Search
In this article we discuss an application scenario for semantic annotation and search in a collection of art images. This application shows that background knowledge in the form of ontologies can be used to support indexing and search in image collections. The underlying ontologies are represented in RDF Schema and are based on existing data standards and knowledge corpora, such as the VRA Core...
متن کاملامکانسنجی طرح تدوین اصطلاح نامۀ مطالعات زنان و خانواده براساس استاندارد BS ISO 25964-1
Research Objective: Feasibility study of the Family and Women’s Studies Thesaurus considering the expansion of information in the field of women and family studies, as well as the wide span of related vocabulary and the development of vocabulary lists and bibliographies, the Family and Women’s Studies Thesaurus can be a professional tool for indexing and retrieval of women’s information in data...
متن کاملImproving Content Based Image Retrieval Systems with a Thesaurus for Shapes
Successful retrieval of relevant images from large-scale image collections is one of the current problems in the field of data management. In this paper, I propose a system which combines traditional CBIR techniques, such as feature extraction, with a thesaurus for objects / shapes. A brief presentation of the problem area is given, along with the basics of a prototype system, the VORTEX engine.
متن کاملFaceted Access to Heterogeneous Cultural Heritage Collections using Semantic Web Techniques
Integrated digital access to multiple collections is a prominent issue for many Cultural Heritage institutions. The metadata describing diverse collections must be interoperable, which requires aligning the controlled vocabularies that are used to annotate objects in these collections. We demonstrate an interface prototype presenting two collections whose vocabularies have been matched applying...
متن کامل